Skip to main content

The Archive Team Just In Time Grabs

The hardest part about our transient, shallow world wide web is the terrifying swiftness in which data disappears. To this end, Archive Team members have often bravely strapped on miner's helmets and flashlights, dove into the flaming wreckage of a dying site, and grabbed a copy for all of time. Some of these rescues, consisting of what we could grab, are being saved here.

Please Note: Some of these items were not burning as brightly or recently as others - they might be merely considered "off-site backups" of sites or collections, but in most cases the original data is now gone.


PART OF
Archive Team
More right-solid
More right-solid
More right-solid
SHOW DETAILS
up-solid down-solid
Prior Page
eye
Title
Date Archived
Creator
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,901
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1982-01 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 7,890
favorite 0
comment 0
This is a panic download of www.theguardian.com 1999-sitemap-urls as of 2014-05-07.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,888
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1981-06 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
This dataset is a collection of scraped public twitter updates used in coordination with an academic project to study the geolocation data related to twittering. From the explanatory PDF in the dataset collection: We provide both training set and test set (collected from September 2009 to January 2010) in the paper You Are Where You Tweet: A Content-Based Approach to Geo-locating Twitter Users in CIKM 2010. The training set contains 115,886 Twitter users and 3,844,612 updates from the users....
Topics: academic paper, twitter, tweets, location, geolocation, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,796
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1982-03 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,794
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1981-05 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,786
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1981-01 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,765
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1981-07 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,751
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1982-04 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 7,736
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1982-02 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 7,621
favorite 0
comment 0
This is a panic download of 2010-04 world articles from theguardian.com as of 2013-09-10.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.joystiq.com
web
eye 7,585
favorite 0
comment 0
This is a panic download of www.joystiq.com images as of 2015-01-29.
Topics: www.joystiq.com, archiveteam
The Archive Team Just In Time Grabs
by www.atlus.com
web
eye 7,519
favorite 0
comment 0
This is a panic download of www.atlus.com/forum/ as of 2013-11-10. NOTE: My first grab was incomplete. This should be complete.
Topics: www.atlus.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 7,327
favorite 0
comment 0
This is a panic download of www.theguardian.com 1998-sitemap-urls as of 2014-05-07.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by medium.com
web
eye 7,313
favorite 0
comment 0
This is a panic download of medium.com sitemap-posts-2016-04-15 as of 2017-01-14.
Topics: medium.com, archiveteam
The Archive Team Just In Time Grabs
by catholiceducation.org
web
eye 7,184
favorite 0
comment 0
This is a panic download of catholiceducation.org as of 2013-07-12.
Topics: catholiceducation.org, archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 6,983
favorite 0
comment 0
This is a image dump i create by scanning my 2008 engadget articles dump for all image urls.
Topics: www.engadget.com, engadget images, engadget, archiveteam
The Archive Team Just In Time Grabs
web
eye 6,871
favorite 1
comment 0
A snapshot of isoHunt's Facebook Page before they "self-destructed." The timeline was manually expanded by showing all stories. The HTML and PDF were generated in Firefox in 24.0. The WARC was generated with Qupzilla 1.4.4 and odie5533's WarcMITMProxy.
Topic: archiveteam
The Archive Team Just In Time Grabs
by orteil.dashnet.org
web
eye 6,843
favorite 0
comment 0
This is a panic grab of orteil.dashnet.org as of July 31, 2014.
Topics: archiveteam, cookie clicker, orteil.dashnet.org
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 6,751
favorite 0
comment 0
This is a panic download of 2011-12 world articles from theguardian.com as of 2013-10-01.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 6,736
favorite 0
comment 0
This is a mirror of www.engadget.com articles with /2008/ urls in them as of 2012-12-06.
Topics: www.engadget.com, engadget articles, engadget, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 6,710
favorite 0
comment 0
This is a panic download of www.theguardian.com 2000-sitemap-urls as of 2014-05-08.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
web
eye 6,681
favorite 0
comment 0
Topic: archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 6,675
favorite 0
comment 0
This is a image dump i create by scanning my 2009 engadget articles dump for all image urls.
Topics: www.engadget.com, engadget images, engadget, archiveteam
The Archive Team Just In Time Grabs
web
eye 6,671
favorite 0
comment 0
This is images found in my warc.gz dump of the feed as of 2013-01-24. The links are based on a 2013-01-10 dump of the feed but these images were grab on 2013-01-24.
Topics: g4tv.com, thefeed, images, archiveteam
The Archive Team Just In Time Grabs
by theblaze.com
web
eye 6,641
favorite 0
comment 0
This is a panic download of http://theblaze.com/wp-content/* as of 2012-09-03. This is based on data from my theblaze.com stories mirror here below: http://archive.org/details/theblaze.com-stories-20120903-mirror
Topics: theblaze.com, theblaze, glenn beck, archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 6,614
favorite 0
comment 0
This is a mirror of www.engadget.com articles with /2011/ urls in them as of 2012-12-07.
Topics: www.engadget.com, engadget articles, engadget, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 6,581
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-2007-08 as of 2015-06-05.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by torrentfreak.com
web
eye 6,544
favorite 0
comment 0
This is a panic download of torrentfreak.com as of 2013-06-19.
Topics: torrentfreak.com, archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 6,503
favorite 0
comment 0
This is a image dump i create by scanning my 2007 engadget articles dump for all image urls.
Topics: www.engadget.com, engadget images, engadget, archiveteam
The Archive Team Just In Time Grabs
web
eye 6,477
favorite 0
comment 0
pingmag.jp is an art site that just reopened after being closed for years. All the old content was posted online again, so this grab contains all their historical content.
Topics: webcrawl, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 6,442
favorite 0
comment 0
This is a panic download of 2010-11 world articles from theguardian.com as of 2013-09-17.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 6,384
favorite 0
comment 0
This is a panic download of 2000-04 world articles from theguardian.com as of 2013-08-24.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 6,376
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-1982-06 as of 2015-05-24.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 6,354
favorite 0
comment 0
This is a panic download of 2010-02 world articles from theguardian.com as of 2013-09-09.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by kompleteclub.com
web
eye 6,249
favorite 0
comment 0
This is a panic download of kompleteclub.com/forums/ as of 2013-04-02.
Topics: kompleteclub.com, forums, archiveteam
The Archive Team Just In Time Grabs
by www.dailymail.co.uk
web
eye 6,092
favorite 0
comment 0
This is a panic download of www.dailymail.co.uk articles-2012-06-18 as of 2015-11-23. This is based on www.dailymail.co.uk sitemap xml.
Topics: www.dailymail.co.uk, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 6,069
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-07-sitemap-urls as of 2015-09-16.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 6,063
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-2007-04 as of 2015-06-05.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by arstechnica.com
web
eye 5,944
favorite 0
comment 0
This is a panic download of arstechnica.com images from 2010 as of 2012-12-07. I got this from my articles dump.
Topics: arstechnica.com, arstechnica images, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,937
favorite 0
comment 0
This is a panic download of www.theguardian.com 2007-05-sitemap-urls as of 2014-10-03.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,908
favorite 0
comment 0
This is a panic download of 2008-09 world articles from theguardian.com as of 2013-09-02.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by g4tv.com
web
eye 5,902
favorite 0
comment 0
This is a panic download of all the image urls from my early g4tv.com/games/ dump. This was done on 2013-03-16.
Topics: g4tv.com, games, images, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,893
favorite 0
comment 0
This is a panic download of 2006-11 world articles from theguardian.com as of 2013-08-30.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.regretsy.com
web
eye 5,875
favorite 0
comment 0
This is the image grab from my earlier dump of www.regretsy.com. Turns out that there is are linked to static.regretsy.com and images.regretsy.com but most redirect to www.regretsy.com. Only www.regretsy.com urls was grabed earlier anyways.
Topics: www.regretsy.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,801
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-05-sitemap-urls as of 2015-09-16.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by mojang.com/notch
web
eye 5,744
favorite 0
comment 0
This is a panic grab of mojang.com/notch as of September 12, 2014.
Topics: archiveteam, mojang, notch, minecraft, mojang.com, mojang.com/notch
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 5,739
favorite 0
comment 0
This is a mirror of www.engadget.com articles with /2009/ urls in them as of 2012-12-07.
Topics: www.engadget.com, engadget articles, engadget, archiveteam
The Archive Team Just In Time Grabs
web
eye 5,737
favorite 0
comment 0
This is a grab of thersa.org from 2013-10-07. It was a test run for the ArchiveBot.
Topics: archiveteam, webcrawl, archivebot
The Archive Team Just In Time Grabs
by g4tv.com
web
eye 5,713
favorite 0
comment 0
This is a panic download of the video pages from g4tv.com/videos/ as of 2013-03-09. This includes comment pages of the posted videos on g4tv. This also includes descriptions of the videos posted.
Topics: g4tv.com, g4tv, video pages, archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 5,638
favorite 0
comment 0
This is a mirror of www.engadget.com articles with /2010/ urls in them as of 2012-12-07.
Topics: www.engadget.com, engadget articles, engadget, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,583
favorite 0
comment 0
This is a panic download of www.theguardian.com 2007-10-sitemap-urls as of 2014-10-05.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
web
eye 5,525
favorite 0
comment 0
This is a panic download of http://katproxy.com/community/ images as 2013-08-06.
Topics: katproxy.com, kickass torrents, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,426
favorite 0
comment 0
This is a panic download of 2009-01 world articles from theguardian.com as of 2013-09-05.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 5,390
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-2007-01 as of 2015-06-05.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.engadget.com
web
eye 5,309
favorite 0
comment 0
This is a mirror of www.engadget.com articles with /2005/ urls in them as of 2012-12-06.
Topics: www.engadget.com, engadget articles, engadget, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,298
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-02-sitemap-urls as of 2014-10-06.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,282
favorite 0
comment 0
This is a panic download of 2008-11 world articles from theguardian.com as of 2013-09-04.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.nytimes.com
web
eye 5,264
favorite 0
comment 0
This is a panic download of www.nytimes.com sitemap-urls-2007-09 as of 2015-06-05.
Topics: www.nytimes.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,248
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-03-sitemap-urls as of 2014-10-06.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,222
favorite 0
comment 0
This is a panic download of www.theguardian.com 2007-09-sitemap-urls as of 2014-10-05.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,216
favorite 0
comment 0
This is a panic download of 2009-09 world articles from theguardian.com as of 2013-09-07.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,214
favorite 0
comment 0
This is a panic download of 2008-08 world articles from theguardian.com as of 2013-09-02.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by store.steampowered.com
web
eye 5,180
favorite 0
comment 0
This is a panic download of store.steampowered.com app-10-to-300000 as of 2013-12-15.
Topics: store.steampowered.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,179
favorite 0
comment 0
This is a panic download of www.theguardian.com 2007-01-sitemap-urls as of 2014-09-29.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,109
favorite 0
comment 0
This is a panic download of 2011-11 world articles from theguardian.com as of 2013-10-01.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,077
favorite 0
comment 0
This is a panic download of 2010-03 world articles from theguardian.com as of 2013-09-09.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,036
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-06-sitemap-urls as of 2015-09-16.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.isoc.org
web
eye 5,009
favorite 0
comment 0
This is a panic download of www.isoc.org as of 2012-09-24. www.isoc.org redirects mostly to www.internetsociety.org now.
Topics: www.isoc.org, internet society, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 5,003
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-04-sitemap-urls as of 2015-02-27.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 4,968
favorite 0
comment 0
This is a panic download of www.theguardian.com 2007-03-sitemap-urls as of 2014-10-02.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 4,959
favorite 0
comment 0
This is a panic download of www.theguardian.com 2008-08-sitemap-urls as of 2015-09-16.
Topics: www.theguardian.com, archiveteam
The Archive Team Just In Time Grabs
by kat.ph/blog/
web
eye 4,947
favorite 0
comment 0
This is a panic download of kat.ph/blog/. This is a bittorent/pirate website. The blog part of kat.ph is what i backup. I also did a image dump of all images on this blog. The image dump is also in warc.gz format too.
Topics: kat.ph/blog/, kickasstorrents, archiveteam
The Archive Team Just In Time Grabs
by martinmanleylifeanddeath.com
web
eye 4,907
favorite 0
comment 0
This a panic download of martinmanleylifeanddeath.com as 2013-08-16.
Topics: martinmanleylifeanddeath.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
web
eye 4,870
favorite 0
comment 0
This is a panic download of www.theguardian.com 2007-11-sitemap-urls as of 2014-10-06.
Topics: www.theguardian.com, archiveteam